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SEQUENCE USSHNG 



(1) GENERAL 

(i) AHSZOOT: EKMmG, Jeffrey 
vote, carl 

(ii) HUE OF JNVENEECN: IEMEHIinaN-EEIA, DWFHDICDON-^^^ 

arauxES, h5»widii<3l heehrkdens and therbeeditc 

USES THEREOF 
(iii) NUMBER OF SEQUENCES: 13 

(iv) OggffiS PQM^ g MCEESS: 

(A) ftECEESSEE; c/o FISH & NEAVE 

(B) STREET: 1251 Avenue of the Americas 

(C) cm: New York 

(D) sraiE: New York 

(E) GDCNIRY: U.S.A. 

(F) ZIP: 10020 

(V) CCMFUIER READABLE KRfc 

(A) MEDKM TYPE: FLcppy disk 

(B) GCMEUUR: IEM PC n.iUHfitilft 

(Q crawnnc SY5HM: fchds/ms-dos 

(D) SOFTOBRE: Pafcentln Release #1.0, Vecsicn #1.25 

(vi) ORIENT AKIZCKEICN DKIA: 

(A) AmZCKECCN NCMER: 

(B) FTTTNG DA2E: 

(C) <T ASSTFTCATKN: 

(vii) HRICR AEPLTCKI1CN DMA: 

(A) AEKICKECCN NCMER: PCT/US91/04588 

(B) FILING DREE: 27-JUML991 

(vii) ICTCR ATOLICKETCN DMA: 

(A) AHETJCKEKN NUMBER: US 07/544,862 

(B) FDUNG DKEE: 27-JCN-1990 

(viii) AITCRJEY/AGENT INFCRflKEDCN: 

(A) NAME: HALEY Jr., Janes F. 

(B) OTSDKIRKrBCN NtMBER: 27,794 

(Q RETIHENCE/DOCJGEir NUMBER: EL29CEFH 



- 77 



(ix) IBUEPCMIKKKCDCN INFCRMKUDCN: 

(A) miHENE: (212) 596-9000 

(B) TFTFTOX: (212) 596-9090 
(Q TELEX: 14-6367 



(2) INFCRM3CN ICR SEQ ID NO:l: 

(i) SBBDEMCE QffiRftCgKISnGg: 

(A) UNSZH: 726 base pairs 

(B) TYPE: mUeic acid 
(CO SIRMCEINESS: double 
(D) ICPCtOCT: linear 



(ix) FEKECPE: 

(A) NfiME/KEY: CDS 

(B) IDCmCK: 1..723 



(Xi) SEQUENCE IESOOFnCN: SEQ ID ND:1: 

CIG GQS CIG GAG 033 M3 G3T 033 AGS CIC GAG GSG AG3 GST TGC CIC 48 
Leu Gly Leu Gin Gly Arg Gly Gly Arg Leu Gin Gly Arg Gly Ser Leu 
15 ID 15 

CIG CTA GCT GIG GCA G2A GOC ACT TCT GIG GIG ACC TIG TIG CJG GOS 96 
leu Leu Ala Val Ala Gly Ala Tnr Ser Lai Val Ihr Leu Leu Leu Ala 
20 25 30 

GTS OCT ATC ACT GTC GIG GCT GIG GIG GOC TEA GIG GOG GAG GAT CSG 144 
Val Pro Hb Hit Val Leu Ala Val Lsu Ala Lsu Val Pro Gin Asp Gin 
35 40 45 

QGA G3A GIG GEA AOG GSG AGS GX GBC GOC 035 GCA CPG GOC C3G C2A 192 
Gly Gly Leu Val Thr Gin Ihr Ala Asp Pro Gly Ala Gin Ala Gin Gin 
50 55 60 

QGA CDS G35 TIT GBG AAG GIG OA GAG QG GAG OCA GAA N3i GAT GDC 240 
Gly Leu Gly Rie Gin Iys Leu Pro Gin Glu Glu Pro Glu Ohr Asp leu 
65 70 75 80 

A3C GOC G33 CIC OCA GCT GOC CSC GIG A2A GOC GCT OCS GIG AAG 033 288 

Ser Pro Gly Leu Pro Ala Ala His leu lie Gly Ala Pro Leu Iys Gly 

85 90 95 

CBG Q33 CEA GOC jEGG CTG AOS AOS AAG GAA CPG QCG TTT CBS M3 MC 336 
Gin Gly Leu Gly Trp Glu Ihr Thr Iys Glu Gin Ala Ite Leu Ihr Ser 
100 105 110 

033 AOS G&G TIC TOG GAC GOC GAS GSG GIG G03 GDC 003 GAG GAC G3C 384 
GlyThrGln»eSerAspAlaGluGlyl£uAlaI^ 

115 120 125 
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130 135 140 

GSC GSC GSS OC GSC CM GSC OSC TOS C3TC MS CPS OQC MC TCT CIG 480 
GOy Gly Gly Asp ProGlnGlyArgSerV^ThrLeuArgSerSerLfu 
145 150 155 160 

■I^CnSGaSGSSGSCGCElMGSSaaSGSCACTaXG^ 528 
Tvr Am Ala Gly Gly Ala Tyr Gly Pro Gly Thr Pro Gin Hai Leu l£U 
165 170 175 



QG qqc GOC (3¥3 MS GIG ACT CCk GIG CTG GRC 0QS GCC AGS AGA CBA 
Gin Gly Ala fflu Hit Val Ihr Pro \fel leu Asp Pro Ala Arg Arg Gin 
180 185 190 



GSS 1GA 
Gly 



576 



GSS 1AC GSS OCT CTC TGS TM AOSAGCGIGGSSTICGSCGSCCIGGIG 624 
« Gly IVr Gly Pro Lai Tip iyr Ohr Ser Gl^ 

M 295 200 205 

f CTC OSS MS GSC (SG AGS GIG 1M GIC AM ATC AGT CM OX GKT 672 

O GlnD^AroAroGlyGluArgVsa^ 

IU 210 215 220 

ATC GIG GM TIC G0S MA GSS AM AOC TEC TIT GSS GOC GTS ATO GIG 720 
Mat V&l Asp Fte Ala Arg Gly Iys Thr Ifce Ete Gly Ala Val Met Val 
U 225 230 235 240 



726 



(2) 3NFCRM33CN KR SEQ ID ND:2: 

(i) SEQUENCE Q3SRMI&&SXXGS: 

(A) IfNSIH: 241 amino adds 

(B) TYPE: amino acid 
(D) TCPCXCCT: linear 

(ii) 1ELEOJLE TYEE: protein 

(XL) SEQUENCE EESdaPTECN: SEQ ID ND:2: 

Dau Gly Leu Gin Gly Arg Gly Gly Arg Lea Gin Gly Arg Gly Ser l£u 
1 5 10 15 

l£ja Leu Ala Val Ala Gly Ala Thr Ser Leu VSal Ihr leu leu Leu Ala 
20 25 30 

Vfcl Pro lie Hit Val Leu Ala Val Leu Ala leu Val Pro Gin Asp Gin 
35 40 45 

Gly Gly Leu Val Bar Glu Thr Ala Asp Pro Gly Ala Gin Ala Gin Gin 
50 55 60 
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Glyl^GlyaeGlnl^I^itoCnuCauG^iiDGlu'IhrA^jL^ 

SerEroGlylBuEtDAlaMaHislaillfiGlyAlaErolailysGly 
85 90 95 

GlnGlylaiGlyTcpGluIhrllTrlys GUi'Gln Ala Ite Uai Hir Ser 
100 105 HO 

GlyahrGlnHieSer^AlaGluGlylfiuAlalaiErocanAspGly 
115 120 125 

i^lvrTyrlfluOyrcysl^V^GlyiyrArgGly^AlaPtoIto 
130 135 I* 0 



GlyGlyGlyA^EroGliiGlyArgSer^l'IhrLeuArgSerSerlai 
145 150 1S5 lbU 

lyrArgAlaGlyGlyAlaiyrGlyPro^yOteProGlulsal^l^i 

GluGlyAlaGlulhrV^lhrProV^I^A^EroAlaA^ArgGln 
ISO I s5 190 

Gly Tyr Gly Pro Leu Trp iyr Unr Ser Val Gly Pte Gly Gly Lai Val 
195 200 205 

Glnl^AtgAi^GlyGlaAi^V^lyr\^^neSerHisEcoA^) 

210 215 220 

^V^A^iteAlaArgGlyIysTlirEteEteGlyAlaV^M^\fel 
225 230 235 240 

Gly 



(2) IOTOWKneN ICR SEQ 3D ND:3: 

(i) SEQO E NCE OffiRACEEBISnGS: 

(A) LHdH: 606 base pairs 

(B) TYPE: mclfiic acid 
(O STRANDEDNESS: rimhlP 
(D) 1CPGID3Y: linear 



(ix) FEMURE: 

(A) NBME/KEY: CDS 

(B) IDOKEKN: 1..603 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

CIG GCC TTA GIG CGC CPC GAT CSG GGA GGA CIG GTA ACG GAG ACG GGC 
I^AlaLaimitoGlnAspGlnGlyGlylm\^ahrGlulhrAla 

15 10 I s 
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CTC O0C G33 GC^ CTG GCC GAG OA G3A CDS 033 SIT CM M3 CXG CCA 96 
Asp Pro Gly Ala Gin Ala Gin Gin Gly Lai Gly Phe Gin Iys Lea Pro 
20 25 30 

<3G GAG GAG OCA GAA PCk GKT CIC A3C OCC G33 CIC OCA GCT GCC CftC 144 
Glu Glu Otu Pro Glu lhr Asp leu Ser Pro Gly Leu Pro Ala Ala His 
35 40 45 

CIC A1A G3C GCT 003 CDS AfiG Q3G CTG GQS C3A GQC TOG GAG AOS AOS 192 
Lsu He Gly Ala Pro Leu lys Gly Gin Gly l£U Gly Tip Glu lhr lhr 
50 55 60 

AAG GAA CAG GGG TIT CIG AOS AGC GG3 AOS CPG TIC TOG GAC GQC OG 240 
Iys Glu Gin Ala Hie Leu lhr Ser Gly lhr Gin Hie Ser Asp Ala Glu 
65 70 75 80 

GQS CIG GGG CIC OGG GAG GAC GQC CIC 1KT 1AC CDC TPC TGT GDC GIC 288 
Gly Leu Ala leu Pro Gin Asp Gly Leu iyr Tyr Leu Tyr cys Lsu Val 
85 90 95 

Q3C TPC 033 GQC 003 GG3 COC OCT GQC GQC GQ3 GBC OX GAG GQC GQC 336 
Gly iyr Arg Gly Arg Ala Pro Pro Gly Gly Gly Asp Pro Gin Gly Arg 
100 105 110 

TOG GIC AOS CIG 03C AGC 1CT CIG TPC 033 GGG GQ3 GQC GCC UPC 033 384 
Ser Val lhr leu Arg Ser Ser Leu iyr Arg Ala Gly Gly Ala iyr Gly 
.115 120 125 

O03 GQC PCT COC GfiG CIG CIG CIC GAG GQC GCC GAG AOG GIG ACT OCA 432 
Pro Gly lhr Pro Glu Lsu Leu Leu Glu Gly Ala Glu lhr Val lhr Pro 
130 135 140 

GIG CIG GAC 033 GOC AG3 ASA CPA G33 TPC GQ3 OCT CDC 1G3 1AC AGS 480 
Val Lsu Asp Pro Ala Arg Arg Gin Gly iyr Gly Pro Leu Trp iyr lhr 
145 150 155 160 

PGZ GIG G33 HC GQC GQC CDS GIG CPG CDC 033 AGS GQC (3*3 AGS GIG 528 
Ser Val Gly Rie Gly Gly Leu Val Gin Leu Arg Arg Gly Glu Arg Val 
165 170 175 

TPC GDC AAC A1C AGT CAC COC GAT AUG GIG GAC TIC GGG AGA GQS AfiG 576 
lyrV^AsnlleSerffisProAspMetV^AspI^ 

180 1S5 190 

PCC TIC TTT Q3S GQC GIG AIG GIG Q3S TGA 606 
lhr Ite Hie Gly Ala Val Met Val Gly 
195 200 



(2) INFCFMKI1CN FOR SEQ ID NO: 4: 

(i) SEQUENCE QRRACTFRISITCS: 

(A) IEN3IH: 201 amino acids 

(B) TYPE: amino aedd 
(D) ICKZOStf: linear 
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(ii) KXE3IE THE: protein - 
(xi) SEQUENCE CESCa gPOC K: SEQ ID NO: 4: 

Leu Ala leu Val Pro Gin AspGlnGlyGlyliaiValThrGlu'IhrAla 
l 5 10 15 

Asp Pro Gly Ala Gin Ala Gin Gin Gly leu Gly Ptie Gin Lys leu Pro 
20 25 30 

Gin Gin Gin Pro Gin Tnr Asp Leu Ser Pro Gly Leu Pro Ala Ala His 
35 40 45 

Leu lie GLy Ala Pro Leu Lys Gly Gin Gly Leu Gly Tcp GLu Tnr Tnr 
SO 55 60 

Iys Gin Gin Ala Ehe Leu lhr Ser Gly Ihr Gin Rie Ser Asp Ala Glu 
65 70 75 80 

GlyleuAlaLeuRTOGlnAspG^LeuTyrTyrleuTyrcysLeu Val, 
85 90 95 

Gly Tyr Arg Gly Arg Ala Pro Pro Gly Gly GLy Asp Pro Gin Gly Arg 
100 105 110 

SerValTflrliaaArgSerSerLmTyrArgAlaGlyG^^ 
US 120 125 

Pro Gly Ihr Pro Gin Leu Leu Leu Gin GLy Ala Glu Tar Val Tnr Pro 
130 135 140 

Val Leu Asp Pro Ala Arg Arg Gin GLy Tyr GLy Pro Leu Tcp Tyr Tnr 
145 150 155 160 

Ser Val Gly Rhe Gly GLy Leu Val Gin Leu Arg Arg Gly Gin Arg Val 
165 170 175 

Tyr val Asn He Ser His Pro Asp Mat Val Asp Roe Ala Arg Gly Lys 
180 185 190 

Thr R*2 Ete Gly Ala Val Met Val Gly 
195 200 

(2) INFOaRTTCN FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYRE: rucleic acid 
(O STRBNEEEHESS: double 
(D) TCKHXGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CCS 

(B) IOCKEKN: 1..447 
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(xi) SEQUENCE EESCSZFTECN: SEQ ID N0:5: 
OX CIG AM GX CM G3S CIA GX TGG GSG AOS AOS AM (3& CM GGG 48 

fto leu lys Gly Gin Gly leu Gly Trp Glu lhr Shr lys Gin Gin Ala 
1 5 10 IS 

TTT CTG MS MC GX ACS CM TIC IDG GM GX GAG GGG CIG GGG CEC 96 
She Leu Ohr Ser Gly Thr Gin Die Ser Asp Ala Glu Gly Leu Ala Leu 
20 25 30 

QQG CK3 GAC GX CEC TAT TOC CEC TRC TGT CZC GZC Q3C TPC OX G3C 144 
Pro Gin Aqp Gly Leu Tyr iyr leu T^r cys leu Vfed Gly Tyr Arg Gly 
35 40 45 

QQG GX OX OCT GX G3C G3S GAC OX CM GX OX TCG GTC AOS CIG 192 
Arg Ala Pro Pro Gly Gly Gly Asp Pro Gin Gly Arg Ser Val Ihr Lai 
50 55 60 

OX AX TCT CIG TAG COG QC33 GX QX GX TPC GX COG GX ACT OX 240 
Arg Ser Ser Leu Tyr Arg Ala Gly Gly Ala T/r Gly Pro Gly The Pro 
65 70 75 80 

GM CIG CDS CIC GM GX GX GAG AOS GD3 ACT OCA GOG CIG GAC OGG 288 
Glu leu Leu Leu Glu Gly Ala Glu Uir Vial Thr Pro Val Leu Asp Pro 
85 90 95 

GX MS MA CRA QX TRC G33 OCT CIC TQS TPC AOS MC GIG GX TIC 336 
Ala Arg Arg Gin Gly lyr Gly Pro leu Ttp Tyr Thr Ser Val Gly Fhe 
100 105 110 

GX QX CDS GIG CM CIC OX AOS GX GM MS GPS TPC GIC AM ATC 384 
Gly Gly Leu Val Gin leu Arg Arg Gly Glu Arg Val lyr Val Asn He 
115 120 125 

MT CM OX GKT A3G GIG GAC TIC GX MA GX AM AX TIC TTT GX 432 
SerHisProA^MsrtV^AspR^AlaArgGlylys Uir Ite fha Gly 
130 135 140 

GX GIG ADS GIG GX TGA 450 
Ala Val *fet Val Gly 

145 



(2) INFCRMTCN FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARAt :i VK1STICS: 

(A) LEN3IH: 149 amino adds 

(B) TffiE: amino acid 
(D) TCPCHXCT: linear 

(ii) KTrrrTTTK 2YFE: protein 

(xi) SEQUENCE DE9C3HP333CN: SEQ ID ND:6: 



II! 



EroLeul^GlyGlnGlyl^Glyl^GlulhratelysGluGlnAla 
1 5 10 15 

Itel^ulhrSerGlyThrGlnHESerAspAlaGluGlyLaiAlalai 
20 25 30 

ProGlnAspGlyl^iyrTyrl^TyrCysI^valGly'iyrArgGly 
35 40 45 

ArgiOaErolteGlyGlyGly^EroGlnGLyAEgSerVaiahrlfiu 
50 55 60 

AraSerSerlfiuTyrArgJOaGlyGlyAlaiyrGlyEroGlyatorieo 
65 70 75 80 

GaiiifiuLaaLsuGliiGLyAlaGbiahr V&L Thr Pro l£U Asp Pro 
85 9° 95 

A3aArgArg<^<aylyrGlyErolmTi5TyrahrSerValGly^ 
100 105 HO 

GlvGlyl^\^GlnI^/a^ArgGlyGluAi^\^Tyr^Asnne 

115 120 325 

SerHisEroAsph^V^AspEteAlaArgGlylyslhrPneRTeGly 
130 135 140 

Ala Val Mat Val Gly 

145 

(2) INFCRWUCN KR SEQ ID NO:7: 

(i) S EC P E N C E OKRACilKlSTICS: 

(A) mem: 156 base pairs 

(B) TSEE: nucleic acid 
(Q SIRMCEENESS: dafrilp 
(D) TCFTfTOCT: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) 103030): 1..156 



(xi) SECPENCE CE9CKIPEKK: SEQ ID N0:7: 

CIG G0C TDV GIG COCCAGGKr CAG GGA GGA CIG OA ACE GfiG ACG GCC 
I^Alal^V^EroGln^GlnGlyGlyl^^^GliilhrAla 

15 10 15 

GAC CCC Q3S GCA CSC Q0C C8G CAA GGA C3X5 GSG TTT CAG AAG CDS CCA 
Asp Pro Gly Ala Gin Ala Gin Gin Gly Leu Gly Hie Gin lys Isu H» 
20 25 30 
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5; a. 



CMGMG«a*(^ACAGaTCIEAGCaX 144 
GlnGlnGluEro(^ahrAspIeuSerBro<ay 

35 40 45 



CIC A3A. GGC GCT 
leu lie Gly Ala 
50 

(2) INFCRMOEN TCR SEQ ID JO:8: 

(i) SEQUENCE CHWMUHISIICS: 

(A) UN3IH: 52 amino acid s 

(B) TOEE: amino acid 
(D) TOTCEOCT: linear 

(ii) MXECULE TOFE: protein 

(XL) SEQUENCE DESOEOPnCN: SEQ ID ND:8: 

I^AlalmV^ProGlnAspGlnGlyGly^ 

15 10 15 

Asd Pro Gly ALa Gin Ala Gin Gin Gly Leu Gly Ete Gin lys l£u Pro 
20 25 30 

GiuGliiGlnProGlnTIirAspImSerlto 

35 40 45 

Leu He Gly Ala 
50 

(2) INECRMUCN FOR SEQ ID N3:9: 

(i) SEQI EN CE CHMRMHKESTTCS: 

(A) UN3IH: 17 base pairs 

(B) TOPE: rucleic acid 

(C) SI5WCEENESS: single 

(D) TCBCBXGK: linear 

(ii) MTHTTTF. TOPE: cCNA 
( iii ) HXPCHHEEarftL: NO 
(iv) WEE-SENSE: YES 



156 



(xi) SEQUENCE EESCRTPT33CN: SEQ ID N0:9: 
GTYKN33CT COTCOTC 



17 



(2) INFCTMKEKN FOR SEQ ID NO: 10: 

(i) S EQ O B CE GQRPCTEKISnCSz 

(A) LENS3H: 17 base pairs 

(B) TOPE: nacleic acid 
(Q STIRRNDEINESS: single 
(D) TCPOrrra: linear 

(ii) TOPE: cCNA 

HQOZBEEXCSL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCKEPHCN: SEQ ID N0:10: 

(2) 3NKRM3CN PGR SEQ 3D ND:11: 

(i) SEQUENCE CHRKCTERESEECS: 

(A) LENSIH: IS base pairs 

(B) TOPE: nrdeic acid 
(Q SIRfiNDEENESS: sin gle 
(D) TCPCODGY: linear 

(ii) MTmiTF TOPE: cCNA 

(iii) mtOIHfcOlCAL: NO 

(iv) ANTE-SENSE: NO 



(Xi) SEQUENCE EESCKEPTECN: SEQ 3D N0:11: 
MQ333QCAC U3S33CJG 
(2) INFCBMEEKN ICR SEQ ID NO: 12: 

(i) SEQUENCE CHBRft:iTKiSTECS: 

(A) I£NGIH: IS base pairs 

(B) TOPE: rucleic acid 

(C) SIRANDEENESS: double 

(D) TCFCSXCT: linear 

(ii) M3J5HJLE TOPE: cENA 
(iii) HffCOHEEICAL: NO 
(iv) ANTI-SENSE: NO 
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(ix) JEKEURE: 

(A) NHHE/KEf: miscfeature 

(B) nrmcN: 1..19 

(D) OTHER 1NFOWKEKN: /furr±icn= "lister 41 

(ix) FEKItHE: 

(A) NSME/KEY: xniscjfeafcure 

(B) IDCSOTCN: 1..19 

(D) OTHER INFCraAIDDCN: /note= tf Ihe 5 1 terminal 4 

nxfleotite in tte nrrplpmpntary stxand are not 
p jesa ± in this lirioer 11 



(XX) 'SEQUENCE EESOOPEKK: SEQ 3D ND:12: 

(2) INECRMTCN KR SEQ ID N0:13: 

(i) SEQ! ieute OffiRMTEKISEICS: 

(A) mUEH: 27 base pears 

(B) TYPE: rucleic add 
(Q SERMCEENESS: single 
(D) OTC8JDCT: linear 

(ii) VDLEOJLE TYPE: CCNA 

(iii). HXPOTHETICML; NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCTflPCTCN; SEQ 3D NO: 13: 

ctogigkia Gaa aoGac A gcaacaa 



27 



